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Abstract. In a distributed clustering algorithm introduced by 
Coffman, Courtois, Gilbert and Piret pQ, each vertex of Z d receives 
, an initial amount of a resource, and, at each iteration, transfers 

(N ■ all of its resource to the neighboring vertex which currently holds 

the maximum amount of resource. In [3] it was shown that, if 
3 ' the distribution of the initial quantities of resource is invariant 

| under lattice translations, then the flow of resource at each vertex 

eventually stops almost surely, thus solving a problem posed in [5] . 
In this article we prove the existence of translation-invariant initial 
distributions for which resources nevertheless escape to infinity, in 
the sense that the the final amount of resource at a given vertex 
is strictly smaller in expectation than the initial amount. This 
answers a question posed in [4]. 
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1. Introduction 



1.1. Definitions and statement of the main result. Consider, for 



d > 1, the d- dimensional integer lattice. This is the graph with vertex 
. set Z d , and edge set comprising all pairs of vertices (x, y) (= (y,x)) 

with \x — y\ = 1. Here | • | denotes the 1-norm. We use the notation 
Z d for this graph as well as for its vertex set. It will be clear from the 



context which of the two is meant. 



The following model for 'distributed clustering' was introduced by Coff- 
man, Courtois, Gilbert and Piret pQ. To each vertex x of the lattice 
^ Z d , we assign a random nonnegative number Cq(x) G [0, oo] which we 

regard as the initial amount of a 'resource' placed at x at time 0. (The 
family (Co(x);x G Z d ) is not necessarily assumed independent). Then 
we define a discrete-time evolution in which, at each step, each vertex 
transfers its resource to the 'richest' neighbouring vertex. 

More precisely, the evolution is defined recursively as follows. Sup- 
pose that, at time n, the amount of resource at each vertex x is 
C n (x). Let N(x) = {y G Z d : \x — y\ < 1} be the neigh- 
bourhood of x (note that it includes x itself) and define M n (x) = 
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[y G N(x) : C n (y) = max zE N(x) C n (z)}. Now let v n (x) be a vertex 
chosen uniformly at random in M n (x), independently for each x, and 
take: 

, , _ J x, if C n (x) = 
a " W "\ v n (x), ifC n (x)>0. 

Finally, define 

C n+l (x):= C M- 
y-a n {y)=x 

For a fixed vertex x, the random variable Co(x) will be called the initial 
amount of resource at x, and the family [Cq{ e Z d ) will be called 
the initial configuration. Analogously, (C n ( € Z d ) will be called 
the configuration at time n. Note that a n (x) is the vertex to which 
the resources located at x at time n (if any) will be transferred during 
the (n + l)-th step of the evolution. We say that there is a tie in x 
at time n if C n (x) > and the cardinality of M n (x) is strictly greater 
than one. In case this occurs, a n (x) is chosen uniformly at random 
among the vertices around x that maximize C n . Note that, apart 
from those possible tie breaks, all the randomness is contained in the 
initial configuration. As soon as a vertex has zero resource, its resource 
remains zero forever. Also note that, when two or more vertices transfer 
their resources to the same vertex, these resources are added up. Thus 
this algorithm models a clustering process in the lattice starting from 
a disordered initial configuration. 

For a fixed vertex x, we use the notation C^x) for lim^oo C n (x) in 
case this limit exists. We write E for expectation with respect to the 
underlying probability measure. 

Our main result is the following theorem. The proof is given in Sec- 
tion [3J 

Theorem 1. Let d > 2. There exists a translation-invariant distri- 
bution for the initial configuration (Cq(x);x £ Z d ) such that, for each 

x e z d , 

E[C 00 (a;)] <E[C (x)]. (1) 

1.2. Background and motivation. Here is some more terminology. 
If, for all sufficiently large n, we have that a n (x) = x and a n (y) ^ x for 
all neighbours y of x, then we say that the flow at x terminates after 
finitely many steps. In that case, the limit Coo(x) is attained after 
finitely many iterations and will be called the final amount of resource 
at x. If for all sufficiently large n we have a n+1 (x) = a n (x), then we 
say that x eventually transfers its resource to the same fixed vertex. 

The following stability questions for this process (formulated here sim- 
ilarly as in [1]) have been investigated in the literature: 
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Question 1. Does each vertex eventually transfer its resource to the 
same fixed vertex almost surely? 

Question 2. Does the flow at each vertex terminate after finitely many 
steps almost surely? 

Question 3. If the answer to the previous question is affirmative, is the 
expected final amount of resource of a vertex equal to the expected 
initial amount? 

Of course the answers to the above questions may depend on the as- 
sumptions made about the distribution of the initial configuration. 
Note that if the answer to Question [2] is affirmative, then so is the 
answer to Question [TJ In that case, answering Question [3] is equivalent 
to answering the question whether the resource quantity that started 
on a given vertex will eventually stop moving almost surely. So, infor- 
mally, Question [2] is related to fixation while Question [3] is related to 
conservation. 

Van den Berg and Meester [2] considered the case d = 2 and i.i.d. ini- 
tial resource quantities. Using translation-invariance and symmetries 
of the system they proved that the answer to Question [T] is positive in 
the case that the initial quantities of resource have a continuous dis- 
tribution. They also showed that, if the resources are integer valued, 
then Question [2] has a positive answer as well. 

Later, van den Berg and Ermakov [3] considered again i.i.d. continu- 
ously distributed initial quantities of resource on the two-dimensional 
lattice. Using a percolation approach, they were able to relate Ques- 
tions [2] and [3] to a finite (but large) computation. By using Monte 
Carlo simulation, they obtained overwhelming evidence that the an- 
swer to these questions is positive for this case. 

In [1] it was proved that, for every dimension and every translation- 
invariant distribution of the initial configuration, the answer to Ques- 
tion [2] is positive. However, Question [3] was left open. Our Theorem [1] 
says that, for some initial distributions in this class, the answer to that 
question is negative. 

The conclusion of Theorem [1] is false for d — 1. To see that, suppose 
that the probability that the resource starting at the origin does not 
stop after finitely many steps is positive. Then, by translation invari- 
ance, there is, with positive probability, a positive density of vertices 
for which the initial resource will not stop after finite time. This im- 
plies that, with positive probability, there are infinitely many steps at 
which resource enters or leaves the origin, contradicting the fixation 
result of pE] mentioned in the previous paragraph. This argument can 
be generalized for example to any graph of the form ZxG, where G is a 
finite vertex-transitive graph. (For such graphs translation-invariance 
is replaced with automorphism-invariance). 
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In order to prove Theorem [TJ we will construct a random collection 
(forest) of one-ended trees, which is embedded in Z d , in a translation- 
invariant way, and then assign resource quantities to the vertices in 
such a way that, during the evolution, each resource follows the unique 
infinite self-avoiding path to infinity in the forest. In Section [2] we 
present a short discussion about the existence of certain random forests 
on Z d . In Section [31 Theorem [T] is proved. In Section H] we present some 
concluding remarks and open questions. 

2. Translation-invariant forests on 7L d 

Let G be an infinite graph. A forest of G is a subgraph of G that 
has no cycles. A tree is a connected forest. A subgraph spans G if it 
contains every vertex of G. A spanning forest (respectively tree) on G 
is a subgraph of G that is a forest (respectively a tree) and that spans 
G. The leaves of a forest T are the vertices of T that only have one 
neighbor in the forest. The number of ends of a tree is the number of 
distinct self-avoiding infinite paths starting from a given vertex. A tree 
is said to be one- ended if it has one end. 

We choose the (^-dimensional integer lattice as the underlying graph. 
For this choice, the literature provides several constructions of ran- 
dom spanning forests with translation-invariant distributions, for ex- 
ample, the uniform spanning tree [5], and the minimal spanning tree 
[B]. To be explicit, we briefly discuss one construction, based on the 
two-dimensional minimal spanning tree. 

Let E be the set of edges of the lattice Z 2 , and let {U e ; e G E} be a 
family of independent random variables distributed uniformly in the 
interval [0, 1]. For each cycle of the lattice, delete the edge having the 
maximum [/-value on the cycle. The resulting random graph is called 
(free) minimal spanning forest and is known to be almost surely a one- 
ended tree which is invariant and ergodic under lattice translations (see 

®)- 

For d > 2, we can use the two-dimensional minimal spanning forest to 
construct a random forest in Z d of which the distribution is invariant 
under lattice translations, and of which every component is one-ended. 
We regard Z d as Z 2 x 7L d ~ 2 and in each 'layer' Z 2 x {z} (where z runs 
over Z d_2 ) we embed an independent copy of the two-dimensional min- 
imal spanning tree T z . The resulting subgraph of Z d is a translation- 
invariant random spanning forest with one-ended components. This 
gives the following lemma. 

Lemma 2. For each d > 2 there exists a translation-invariant random 
spanning forest on 7L d , of which each connected component is one-ended 
almost surely. 
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Corollary 3. For each d > 2 there exists a translation-invariant ran- 
dom forest T on Z d , for which the following two properties hold almost 
surely. 

(i) Every connected component ofT is one-ended. 

(ii) Every edge of Z 2 of which both endpoints are in T is an edge ofT. 

Proof. Let H be a spanning forest as in Lemma [2] and write F for 
the set of its edges, and V for the set of its vertices. Let H be the 
forest with vertex set {2x : x G V} U {x + y : (x, y) G F} and edge 
set E = {(2x, x + y) : (x, y) G F}. Informally, H corresponds to the 
forest which is obtained when H is scaled up by a factor 2. Thus 
to each edge (x,y) of H, there correspond two edges, (2x,x + y) and 
(x + y,2y), in H. Note that if is a random forest which is invariant 
under translations of 2Z d , and which has the property that every pair 
x, y of its vertices satisfying \x — y\ — 1 is connected by an edge of H. 
To restore invariance under all translations of Z d , let W be a uniformly 
random element of the discrete cube {0, l} d , independent of H, and set 



In this section we fix d > 2. We will prove Theorem [T] by giving an 
explicit construction of an initial configuration (Cq(x),x G Z d ) whose 
distribution is translation-invariant and for which ([1]) holds. 

Let T be a random forest on Z d as given by Corollary [31 For vertices 
x and y of T we write x ~ y if (x, y) is an edge of T. We define a 
(random) partial order < on Z d by setting y < x if and only if x and 
are vertices of T and x belongs to the unique infinite self-avoiding path 
in T starting at y. If y < x we say that x is an ancestor of y and that 
y is a descendant of x. If y < x and x ~ y we say that x is a parent 
of y. Note that every vertex of T has a unique parent. Moreover, for 
every vertex x of T, exactly one vertex in {y : y ~ x} is the parent of 
x, and the others are descendents of x. 

We now define, for each x G Z d , the initial quantity of resource at x 



Note that, if x G T, then Cq(x) is the number of descendants of x. 
Since every connected component of T is one-ended, it follows from 
the definitions that this number is finite. Also note that, since the 
distribution of T is invariant under the translations of Z d , so is that of 
the family (C (x);x G Z d ). 



T = H + W. 



□ 



3. Proof of main result 



by: 




(2) 
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We now define a nested ('decreasing') sequence of forests that will be 
shown to describe the dynamics of resources when C (x) is given by (J2J). 
For a forest S, let (fi(S) denote the forest obtained from S by deleting 
all its leaves. Let T = T and, for n = 1,2,..., define inductively 
T n = 0(T n _i). 

The following observation follows easily from the definitions. 

Observation 4. Let y be a vertex of T, let x be the parent of y, and 
let n > 0. Then x is in T n+ i if and only if y is in T n . 

Lemma 5. For every vertex x inT, there is a finite index n (depend- 
ing on x) such that, for all n > n Q , x does not belong to T n . 

Proof. By Observation HI tiq(x) is at most 1 plus the number of descen- 
dents of x. As we mentioned before, this number is finite. □ 

Lemma 6. Suppose that, for all x, Cq(x) is given by (0|). Then for all 
n>0, 

c n (x)r^--y~^ Cn{y) > [ l xe ,l n; (3) 

I — U , if x ^ l n . 

Proof. We use induction on n. To verify ([3]) for n = we note that, if 
x belongs to T (= T) then, by ([2]), 

E^)=E E^j/H E i[z<x) = c {x)-i. 

y.yyx, y.yyx, z&d z€Z d \{x} 

y<x yS x 

Now, suppose that ([3]) holds for a given n. Since T was taken as in 
Corollary [3j two vertices of T n which are adjacent in 7L d must be linked 
by an edge of T n . By this and it follows that, for each vertex z of 
T n , a n (z) is the parent of z. Therefore, and because C n = outside 
T n , we have 

C n+1 (x) = ifx(£T n+1 ; (4) 

C n+1 {x) = ^ C n{v) itxe T n+1 . (5) 

y.y^x, y<x 

By applying (j^J) and Observation H] (and noting that (j^J) also holds 
for x G T n \ T n+ i, since then both sides of (j^J) are equal to 0), we get, 
for x e T n+ i, 



c n+ i{x) = c n(y) > E E c ^ = E C <+M- ( 6 ) 

y.y^x, y.y^x, z:z^y, y.y^x, 

y<x y<x z<y y<x 

Now (jlj) and (jSJ) complete the induction step, and the proof of Lemma 
El □ 
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Proof of Theorem^ Let the initial configuration be defined as in (j2J). 
Let x G By Lemma E] and Lemma we have that almost surely 
C n (x) = for all sufficiently large n. Hence C^ix) = almost surely. 
On the other hand, it is clear that Cq(x) > with positive probability, 
and hence E[C (x)] > 0. □ 

4. Concluding remarks and open problems 

At the end of the proof of Theorem [I] we mentioned the obvious fact 
that E[Co(x)] > for every x. It turns out that this expectation is 
even oo. Indeed, we have 

E[C (z)] = ^2F[y < x] = ^2F[x + y < x] 

ydT? y&H 2 

= ^2f[x < x - y] = ^2r[x < y] = oo, 

y£Z 2 yeZ 2 

where the second and forth equality follow by relabeling, the third 
equality follows by translation-invariance and the last inequality follows 
from the fact that x has infinitely many ancestors almost surely. 

We have not been able to construct an example where resources escape 
to infinity but the initial amount of resource at a given vertex has finite 
expectation. It is an interesting question whether such examples exist. 

In particular, in our construction, the initial configuration was chosen 
in such a way that, almost surely, the induced dynamics takes place in 
a forest with one-ended components, embedded in Z d and, at each step, 
the resources are transferred from every vertex with non-zero resource 
to its parent. It is not clear if for every initial configuration with these 
properties the expectation of the initial amount of resource of a vertex 
is infinite. We state these considerations more formally by the following 
two questions. 

Question 4. Suppose that (Co{x);x G Z d ) has a translation-invariant 
distribution and is positive exactly on the vertices of a forest with one- 
ended components. Furthermore, suppose that during the n-th step of 
the evolution, every vertex for which C„_i(x) > transfers its resource 
to its parent. Is it the case that E[C (x)] = oo? 

Question 5. Does there exist a translation-invariant distribution for the 
initial configuration for which E[C 00 (a;)] < E[C (a;)] < oo? 

A negative answer to Question H] would yield a positive answer to Ques- 
tion [5j 
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